Protein Domains of Unknown Function Are Essential in Bacteria
نویسندگان
چکیده
UNLABELLED More than 20% of all protein domains are currently annotated as "domains of unknown function" (DUFs). About 2,700 DUFs are found in bacteria compared with just over 1,500 in eukaryotes. Over 800 DUFs are shared between bacteria and eukaryotes, and about 300 of these are also present in archaea. A total of 2,786 bacterial Pfam domains even occur in animals, including 320 DUFs. Evolutionary conservation suggests that many of these DUFs are important. Here we show that 355 essential proteins in 16 model bacterial species contain 238 DUFs, most of which represent single-domain proteins, clearly establishing the biological essentiality of DUFs. We suggest that experimental research should focus on conserved and essential DUFs (eDUFs) for functional analysis given their important function and wide taxonomic distribution, including bacterial pathogens. IMPORTANCE The functional units of proteins are domains. Typically, each domain has a distinct structure and function. Genomes encode thousands of domains, and many of the domains have no known function (domains of unknown function [DUFs]). They are often ignored as of little relevance, given that many of them are found in only a few genomes. Here we show that many DUFs are essential DUFs (eDUFs) based on their presence in essential proteins. We also show that eDUFs are often essential even if they are found in relatively few genomes. However, in general, more common DUFs are more often essential than rare DUFs.
منابع مشابه
Discovering Domains Mediating Protein Interactions
Background: Protein-protein interactions do not provide any direct information regarding the domains within the proteins that mediate the interactions. The majority of proteins are multi domain proteins and the interaction between them is often defined by the pairs of their domains. Most of the former studies focus only on interacting domain pairs. However they do not consider the in...
متن کامل-
The homeobox genes are known to play a crucial role in controlling the development of multicellular organisms. The majority of these genes have been determined to express regulatory proteins act as a regulatory protein. These trans-acting factors regulate the expression of proteins that are necessary during the developmental processes throughout the body. TGIFLX/Y is a homeobox gene and it cont...
متن کاملO-5: Identification of Novel ImmunodominantEpididymal Sperm Proteins Using CombinatorialApproach
Background: Alteration in the protein signatures of functionally immature testicular spermatozoa occurs during their journey through the epididymis. This leads to acquisition of sperm domain specific functions essential for successful fertilization. Epididymal sperm proteins are preferred targets for immunocontraception as well as in elucidating the causes of infertility. The Background of the ...
متن کاملSolving high-order partial differential equations in unbounded domains by means of double exponential second kind Chebyshev approximation
In this paper, a collocation method for solving high-order linear partial differential equations (PDEs) with variable coefficients under more general form of conditions is presented. This method is based on the approximation of the truncated double exponential second kind Chebyshev (ESC) series. The definition of the partial derivative is presented and derived as new operational matrices of der...
متن کاملMolecular Insight into the Mutual Interactions of Two Transmembrane Domains of Human Glycine Receptor (TM23-GlyR), with the Lipid Bilayers
Appearing as a computational microscope, MD simulation can ‘zoom in’ to atomic resolution to assess detailed interactions of a membrane protein with its surrounding lipids, which play important roles in the stability and function of such proteins. This study has employed the molecular dynamics (MD) simulations, to determine the effect of added DMPC or DMTAP molecules on the structure of D...
متن کامل